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Abstract: In this paper we obtain a Bernstein type inequality for a class 
of weakly dependent and bounded random variables. The proofs lead to a 
moderate deviations principle for sums of bounded random variables with ex- 
ponential decay of the strong mixing coefficients that complements the large 
deviation result obtained by Bryc and Dcmbo (1998) under superexponential 
mixing rates. 



1. Introduction 

This paper has double scope. First we obtain a Bernstein's type bound on the 
tail probabilities of the partial sums S n of a sequence of dependent and bounded 
random variables (Xk,k > 1). Then we use the developed techniques to study the 
moderate deviations principle. 

We recall the definition of strongly mixing sequences, introduced by Rosenblatt 
(1956): For any two er-algcbras A and B, we define the a-mixing coefficient by 

a{A,B)= sup \V(Af)B) -V{A)F(B)\ . 

AgA,BgB 

Let (Xk, k > 1) be a sequence of real- valued random variables defined on (f2, A, P). 
This sequence will be called strongly mixing if 

(1.1) a(n) := sup a (A4k, Gk+n) — > as n — > oo , 

fc>i 

where Mj := cr(Xi, i < j) and Qj := <r(Xi, i > j) for j > 1. 
Alternatively (see Bradley, 2007, Theorem 4.4) 

(1.2) 4a(n) ^suplCovC/^VH/IUHglU; /6L M (^), jeL x ,(& + „)}. 

Establishing exponential inequalities for strongly mixing sequences is a very chal- 
lenging problem. Some steps in this direction are results by Rio (2000, Theorem 
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6.1), who obtained a Fuk-Nagaev type inequality, by Dedecker and Prieur (2004) 
who extended Theorem 6.1 in Rio (2000) using coupling coefficients, by Doukhan 
and Neumann (2005) who used combinatorics techniques. In a recent paper Mer- 
levede, Peligrad and Rio (2009) get exponential bounds for subexponcntial mix- 
ing rates, when the variables are not necessarily bounded, obtaining the same or- 
der of magnitude as in the independent case. More precisely they show that, if 
a{n) < cxp(-cn 71 ) and sup l>0 P(|A 4 | > t) < exp(l - f 2 ) with 71 > and 72 > 0, 
such that (I/71) + (I/72) = I/7 > lj then there are positive constants C, C±, C2 
and J] depending only on c, 71 and 72, such that for all n > 4 and A > C(logn) r ' 

H\S n \ > A) < (n + 1) cxp(-AVCi) + cxp(-A 2 /nC 2 ) . 

Here S n = Y^u—i -Xfc- The case not covered by that paper is the case of exponential 
mixing rates and bounded variables, that is 71 = 1 and 72 = 00. The aim of this 
paper is to study this case, to point out several new recent techniques and ideas 
and to comment on the order of magnitude of the probabilities of large deviations. 
Our proofs will be based on estimations of the Laplace transform. 

One of our results is that for a strongly mixing sequence of centered and bounded 
random variables satisfying, for a certain c > 0, 

(1.3) a(n) < exp(— 2cn) , 

we can find two constants c\ and ci depending only on c and on the uniform bound 
of the random variables, such that, for all x > 0, 

P(|5 n | > x) < exp(-cix 2 /n) + exp(-c 2 a;/(log77,)(loglogn)) . 

Then, we use this exponential inequality and the techniques that lead to this re- 
sult to obtain moderate deviations asymptotic results, improving Proposition 2.4 
in Merlevede and Peligrad (2009). Our results show that we can come close up to a 
logarithmic term to the moderate deviations asymptotics for independent random 
variables. Of course a kind of correction is needed since the traditional large devia- 
tions results do not hold for geometrically strongly mixing sequences. As a matter 
of fact the large deviations principle does not hold even in the context of uniformly 
mixing sequences with exponential rates. See Bryc and Dembo (1996), Example 
1, Proposition 5 and Example 2, that point out examples of empirical processes 
of Doeblin recurrent Markov chains, that are therefore </>-mixing with exponential 
mixing rate and that do not satisfy the large deviations principle. 

We mention that strongly mixing sequences form a larger class than absolutely 
regular sequences. The problem of the moderate deviations principle was studied 
for absolutely regular Markov chains with exponential rates in de Acosta (1997) and 
also in Chen and de Acosta (1998), when the transition probabilities are stationary 
and there is a certain restriction on the class of initial distributions. A class of pro- 
cesses satisfying a splitting condition closely related to absolutely regular processes 
was considered by Tsirelson (2008). Recently, Dedecker, Merlevede, Peligrad and 
Utev (2009) considered projective conditions with applications to <fi- mixing. 

Notice that we do not require any degree of stationarity for obtaining Bernstein 
inequality except for a uniform bound for the variables. 

The strong mixing coefficient used in this paper can be generalized by using 
smaller classes of functions than those used in Definition (1.2) to include even more 
examples. Such examples include function of linear processes with absolutely regular 
innovations and Arch models. In this paper, we give an application to the moderate 
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deviations principle for Kernel estimators of the common marginal density of a 
certain class of continuous time processes. 

For the clarity of the proofs it will be more convenient to embed the initial 
sequence into a continuous time process; namely, {X tl t > 0) is defined from the 
original sequence (X n ,n > 1) by X t = Xr t+1 i. For a Borcl set A, define 

(1.4) S A = f X t dt. 

J A 

Then Si 0jn ] = J22=i The strong mixing coefficient of (X t ,t > 0) is defined as 

a(u) := sup a (A4t,Gt+u) — > as n — > oo , 
t>o 

where Ait '■= o~(X v , v < t) and Q w := a(X v , v > w) . 

Notice that, since a(u) < a([u]), if (X n ,n > 1) satisfies (1.3), the continuous 
type mixing coefficients still satisfy a geometrically mixing condition; namely for 
any u > 2, 

(1.5) a{u) < exp(-cu) . 

In the rest of the paper, ||V||<xj stands for the essential supremum of a random 
variable Y. 



2. Results 

Our first result is the following exponential inequality: 

Theorem 1. Let (Xj)j>% be a sequence of centered real-valued random variables. 
Suppose that the sequence satisfies (1.3) and that there exists a positive M such 
that sup 4>1 Halloo < M. Then there are positive constants C\ and C2 depending 

only on c such that for all n > 4 and t satisfying < t < — — — : , 

Ci A/ (log n)(log log n) 

we have 

, , r, ^^ C 2 t 2 nM 2 

logE(cxp(tS* Il )) < „ ,, rn rr, : r , 

v ' 1 — CitM(logn)(loglogn) 

In terms of probabilities, there is a constant C3 depending on c such that for all 
n > 4 and x > 

(2.1) n\S n \ > x) < exp ( - nM2 + M ,ggl )(lQglogn) ) ■ 

As a counterpart, the following Bernstein type inequality holds. 

Theorem 2. Under conditions of Theorem 1, there are positive constants C\ and 
C2 depending only on c such that for all n > 2 and any positive t such that t < 
ClM (\ „ n yi , the following inequality holds: 

, , „ ^ C 2 t 2 (nv 2 + M 2 ) 

logE exp (tS n )) < 2 J ^ , 

1 — C\tM (log ny 

where v 2 is defined by 

(2.2) v 2 = sup(var(X 4 ) + 2 ^ \Cov(X l7 X 3 



i>0 



j>i 
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In terms of probabilities, there is a constant C3 depending only on c such that for 
all n > 2, 



( 2 - 3 ) P(|S„|>a;)<exp(-- aJ^' „ r 

v ; Vl l_ '- v v 2 n + M 2 +xM(\ogn) 

To compare these two results, we notice that the coefficient of x in the inequality 
(2.1) has a smaller order than the corresponding in (2.3). However, the term v 2 n 
can be considerably smaller than nM 2 , which is an advantage in some applications 
when the variables are not uniformly bounded. Notice also that if stationarity is 
assumed, v 2 can be taken as 

v 2 = Var(X!)+4y EXflQX^ > Q(2a 4 )) , 

where Q(u) = ia£{t > 0,P(|Xi| > t) < u} for u in ]0, 1]. 

In the context of bounded functions / of stationary geometrically strongly mixing 
Markov chains, Theorem 6 in Adamczak (2008) provides a Bernstein type inequality 
for S n (f) = f{Xi) + • • • + f(X n ) with the factor logn instead of (logn) 2 , which 
appears in (2.3). To be more precise, under the centering condition E(/(Xl)) = 0, 
he proves that 

P(|S n (/)|>aO<Cexp(-^min( 
where a 2 = lim„ n _1 Var5 n (/). 



na 2 ' log n 



The two previous results are useful to study the moderate deviations principle 
(MDP) for the partial sums of the underlying sequences. In our terminology the 
moderate deviations principle (MDP) signifies the following type of behavior. 

Definition 3. We say that the MDP holds for a sequence (T n ) n of random vari- 
ables with the speed a n — > and rate function I(t) if for each Borel set A, 

- inf I(t) < liminf a n log f(Ja^T n e A) 

teA" n 

(2.4) < limsupa n logP(v / a^T'„ 6 A) < - inf I(t) 

n teA 

where A denotes the closure of A and A° the interior of A. 

Notice that the moderate deviations principle for {S n / \fn) is an intermediate 
behavior between CLT, F(S n /\/n G A) and large deviation, P(S„/n € A). Our 
moderate deviations results are the following: 

Theorem 4. Let (Xj)j>i be a sequence of centered real valued random variables 
satisfying the assumptions of Theorem 1. Let S n = Y^—i Xj, a 2 = V&r(S n ) and 
assume in addition that liminf„_ ) . 0O o 2 Jn > 0. Then for all positive sequences a n 
with 

(2.5) a n -> and ™°" n — -> 00 

(logn) z (loglogn) / 

the sequence (o~~ 1 S n ) n >i satisfies (2.4) with the good rate function I(t) = t 2 /2. 

If we assume that the sequence is L,2-stationary, then by Lemma 1 in Bradley 
(1997), we get the following corollary: 
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Corollary 5. Let (Xj)j>\ be as in Theorem 4- Suppose in addition that the se- 
quence is L2— stationary and er 2 — > 00. Then, linin^oo o~ 2 /n = a 2 > and for all 
positive sequences a n satisfying (2.5), (n -1 / 2 S n ) n >i satisfies (2.4) with the good 
rate function 1(b) = i 2 /(2er 2 ). 

In the next result, we derive conditions ensuring that the MDP holds for the 
partial sums of triangular arrays of strongly mixing sequences. For a double indexed 
sequence (Xj tU ,j > l) n >i of real valued random variables, we define for any k > 0, 

(2.6) a„(k) = swpa(a(X nti ,i < j),a(X riii ,i > k + j) . 

Theorem 6. For all n > 1, let (Xj n ,j > l)„>i be a double indexed sequence of 
centered real valued random variables such that for every j > 1 and every n > 1, 
||^0,n||oo < M n where M n is a positive number. For all n > 1 and all k > 0, let 
ot n {k) be defined by (2.6) and assume that a(k) = sup n>1 a„(fc) satisfies (1-3). 
Define v 2 by 

(2.7) v 2 = supsup(Var(X i . n ) +2V \Cov(X hn , X 3 . n )\) . 

„>i i>0 V £J > 

and suppose v 2 < 00. Let S n = X)"=i ^i,n 1 °~n = Var(5„) and assume in addition 
that liminf n _i. 0O cr, 2 /ri > 0. Then for all positive sequences a n with 

(2.8) a n — > and na n /M 2 (log n) 4 — > 00 

the sequence (c r ~ 1 S n ) n >i satisfies (2.4) with the good rate function I(t) — t 2 /2. 
3. Discussion and Examples 

1. The first comment is on Theorems 1 and 2. Notice that compared to the tradi- 
tional Bernstein inequality for independent random variables there is a logarithmic 
correction in the linear term in x appearing in the inequalities (2.1) and (2.3). We 
includ in our paper another bound. Corollary 12 gives better results than the other 
exponential bound results in the large deviation range, that is when x is close to 
n. As a matter of fact the tail probability PdS^I > x) can be bounded with the 
minimum of the right hand sides of inequalities (2.1), (2. 3) and (4.16). Among these 
inequalities, (2.1) provides the best condition leading to a moderate deviations 
principle when the random variables arc uniformly bounded. 

2. The strong mixing coefficients are not used in all their strength. For obtaining 
our Bernstein type inequalities we can considerably restrict the class of functions 
used to define the strong mixing coefficients to those functions that are coordinate- 
wise nondecreasing, and one sided relations. Assume that for any index sets Q and 
Q* (sets of natural numbers) such that Q C (0,p] and Q* C [n + p, 00), where n 
and p are integers, there exists a decreasing sequence a*(n) such that 

Cov( f(S Q ),g(S Q *)) < a*(n)||/(5Q)|U||/(5Q.)lloo , 

where / and g are bounded functions coordinatewise nondecreasing. Here Sq = 
^2 ie gXi. Clearly the families of functions exp(^ ?;g g txi) are coordinatewise non- 
decreasing for t > and then, for bounded random variables we have for all t > 

(3.1) Cov(cxp(tS Q ),cxp(tSQ.)) < a >)||cxp(^Q)|U|exp(tS Q .))|U. 
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Also by using the functions f(x) = g(x) = x 

(3.2) Cov(Xj,X i+n ) < a^n^XjW^WXj+nWoo . 

As a matter of fact these are the only functions we use in the proof of our Bernstein 
inequality. So if a*(n) decreases geometrically our results still hold. Inequality (3.1) 
is used to bound the Laplace transform of partial sums, and Inequality (3.2) is used 
to bound their variance. Since both of these inequalities (3.1) and (3.2) are stable 
under convolution we can obtain Bernstein type inequality for example for sequences 
of the type X n = Y n + Z n , where Y n is strongly mixing as in Theorem 1 and is Z n 
a noise, independent on Y n , negatively associated, such as a truncated Gaussian 
sequence with negative correlations. 

We point out that similar results can be obtained by using alternative mixing 
coefficients such as the r-mixing coefficient introduced by Dcdeckcr and Prieur 
(2004). Consequently we can treat all the examples in Merlevede, Peligrad and Rio 
(2009), namely: instantaneous functions of absolutely regular processes, functions 
of linear processes with absolutely regular innovations and ARCH(oo) models. 

3. We now give an application to the moderate deviations principle behavior for 
kernel estimators of the density of a continuous time process. 

Let X = (X t ,t > 0) be a real valued continuous time process with an unknown 
common marginal density /. We wish to estimate / from the data (Xt, < t < T). 
In what follows, we will call a kernel a function K from R to K which is a bounded 
continuous symmetric density with respect to Lebesgue measure and such that 



H 

The kernel density estimator is defined as 



lim uK(u) = 0, and / u 2 K{u)du < oo . 



1 [ T „(x~X t 



where — > + and K is a kernel. In order to derive sufficient conditions ensuring 
that the MDP holds for the sequence y/T(fT(x) — E/t(x)), we assume that there 
exists a constant c > such that for any u > 0, 

(3.3) a u = sup a(a(X s ,s < t),a(X s ,i >u + t)< e~ cu . 

t>o 

In addition, we assume that the joint distribution fx, ,x t between X s and X t exists 
and that fx s ,x t = fx Q ,x, t _ e , ■ Applying Theorem 6, we obtain the following result: 

Corollary 7. Suppose that g u — fx ,x u — f <E> / exists for u ^ 0, and that the 
function u i— > sup^, y \g u (x,y)\ is integrable on ]0,oo[ and <? u (.,.) is continuous at 
(x,x) for each u > 0. In addition assume that the strong mixing coefficients of the 
process satisfy (3.3). Then for all positive sequences ax with 

am a T Th 2 T 

a T -> 0, ► 1 , and —- -> oo , 

a T (logT) 4 

the sequence VT^frix) — E/y(a;)) satisfies (2.4) with speed ax and the good rate 
function 

(3.4) I(t)=t 2 /(4j g u {x,x)d u y 1 . 
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Furthermore if f is differentiable and such that f is l-Lipschitz for a positive con- 
stant I, and if arTh^r — > 0, then the sequence y/T(fx(x) — fi x )) satisfies (2.4) with 
speed or and the good rate function defined by (3.4.). 

Some examples of diffusion processes satisfying Condition (3.3) may be found in 
Vcrctcnnikov (1990) (see also Leblanc, 1997). 

4. Proofs 

First let us comment on the variance of partial sums. By using the notation (1.4), 
for any compact set Ka included in [a, a + A] where A > and a > 0, we have that 

Var(StfJ < AsupfvarpQ) + 2 V |Cov(Xi, Xj)\) . 

If the variables are bounded by M, then by using the definition (1.2), we get that 
Vax(S KA ) < Ail + 8 V a t )M 2 < KAM 2 . 

— 'i>l 

If some degrees of stationarity are available we can have better upper bounds. For 
instance if P(|X n | > x) < F(\X \ > x), then by Theorem 1.1 in Rio (2000), 

Var(5 K J < A(Var(X ) +4^ 4>i EX 2 /(|X | > Q(2a t ))) , 

where Q{u) = inf{t > 0,P(|X o | > t) < u} for u in ]0, 1]. 

4.1. Preliminary lemmas 

The first step is to prove an upper bound on the Laplace transform, valid for small 
values of t. Without restricting the generality it is more convenient to embed the 
index set into continuous time. In the following we shall use the notation (1.4). 

Lemma 8. Let (X n ) n >i be as in Theorem 1. Let B > 2 and a > 0. Then for any 
subset Kb of (a, a + B] which is a finite union of intervals, and for any positive t 

1/2 

with tM < (|) A (2%) , we have 

(4.1) logEexp^&J < i?(6.2tV + ^exp(-^)) , 
where v 2 is defined by (2.2). 

Remark 9. Notice that under our conditions v 2 < KM 2 where K = 1 + 8 oh- 
Proof of Lemma 8. If tM < 4/B, then tSK B — 4, which ensures that 

exp(tS KB ) < 1 + tS KB + ^—^t 2 (S KB ) 2 , 

16 

since the function x H> x~ 2 (e x — x — 1) is increasing. Now (e 4 — 5)/16 < 3.1. Hence 

(4.2) Ecxp(tS I<B ) < 1 + 3.1Bv 2 t 2 , 

which implies Lemma 8 by taking into account that log(l + x) < x. 
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If tM > 4/B, it will be convenient to apply Lemma 15 in Appendix, to get the 
result. Let p be a positive real to be chosen later on. Let k = [B/2p], square brackets 
denoting the integer part. We divide the interval (a, a + B] into 2k consecutive 
intervals of equal size B/{2k). Denote these subintervals by {Ij; 1 < j < 2k} and 
let 

k k 

Si = SK B ni 2] -i and #2 = ^ SK B ni 2j - 

3=1 3=1 

By the Cauchy-Schwarz inequality, 

(4.3) 21ogEexp(^ B ) < logE(2<5i) + logEexp(2i5 2 ). 

Now let p = l/(tM). Since (X n ) n >i satisfies Condition (1.3), and since B/(2k) > 
p > 2, by applying Lemma 15 in Appendix, we obtain 

k 

Ecxp(2t5 2 ) < fccxp(^^ - — ) + JjEexp(2tS KB nJ 2 ;)- 

3=1 

Notice that we are in the case tM > A/B implying that p < B/4 and then k > 2. 
Now, under the assumptions of Lemma 8, we have tM < (cj (2B)) 1 ^ 2 which ensures 
that 

cB c c 

MBt < MBt ~cp< MBt < . 

2k ~ 1 tM ~ 2Mt 

Therefore, 

k 

E(exp(2iS 2 ) <^^cxp(-c/{2tM)) + f[Ec X p(2tS KB ni 2j ). 

3=1 

Since the random variables (Xi)i>i are centered, the Laplace transforms of 5*2 and 
each of {SK B r\i 2 j)j>i are greater than one. Hence applying the inequality 

(4.4) | logx — logy| < |a; — y\ for x > 1 and y > 1 , 
we derive that 

logE(cxp(2^ 2 )) < ^logEcxp(2tS A - B n/ 23 ) + cxp(-c/(2tAf )). 

3=1 

Next \2tS KB m 2j \ < 2tMB/(2k). Since p < B/4, k > B/(4p) implying that 
\2tSK B ni 2j I < 4, and consequently we may repeat the arguments of the proof of 
(4.2), so that 

k 

^logEexp(2£SW B nJ 2j ) < Q.2Bt 2 v 2 . 

3 = 1 

It follows that 

logEexp(2tS , 2 ) < 6.2Bt 2 v 2 + (BMt/2) cxp(— c/(2tM)). 

Clearly the same inequality holds true for the log-Laplace transform of S\ which, 
together with relation (4.3), gives the result, o 

The key lemma for proving our theorems is a new factorization lemma. Its proof 
combines the ideas of Bernstein big and small type argument with a twist, diadic 
recurrence and Cantor set construction. 
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Lemma 10. Let (Xj)j>i be as in Theorem 1. Then, for every A > 2(c V 10) 
there exists a subset Ka of [0, A], with Lebesgue measure larger than A/2 (not 
depending on the random process) such that for all t, < tM < co/(log^4) where 

„ c A /clog 2 

C - 8 A V — a" 

(4.5) log(Eexp(iSW. J) < 6.2u 2 t 2 A + (c + l)^ 1 exp(-c/4tM) . 

w/iere v 2 is defined by (2.2). Moreover, if A > 4 V (2c) /or all < £M < we 
can find a constant C depending only c such that 

(4.6) log(Eexp(i% A] ) < Ct 2 M 2 A\ogA. 

Proof of Lemma 10. The proof is inspired by the construction of a "Cantor set" 
and has several steps. 

Step 1. A "Cantor set" construction. Let A be a strictly positive real number 
strictly more than one. Let S G (0, 1) that will be selected later, and let Ica be 
the largest integer k such that ((1 — 5)/2) k > 1/A. We divide the interval [0,A] 
in three parts and delete the middle one of size AS. The remaining ordered sets 
are denoted Kia, Ki,2 and each has the Lebesgue measure A(l — 8)/2. We repeat 
the procedure. Each of the remaining two intervals K\ t \, K\^%, are divided in three 
parts and the central one of length A5(l — 5)/2 is deleted. After j steps (j < fc^), 
we are left with a disjoint union of 2^ intervals denoted by Kj^, 1 < i < 2 3 , each of 

j'-i 

size A((l - S)/2y and we deleted a total length £ AS (I - S)' 1 = A(l - (1 - S) 3 ). 

i=0 

We use the first index of sets Kjj to denote the step, and second one to denote its 
order. Set k = fc^ when no confusion is allowed, and define 

2" 

(4.7) Ka — [_J K Kl . 

i=i 

We shall use also the following notation: for any I in {0, 1, A^}, 

l= 0-l)2 fc - f + l 

implying that for any I in {0, 1, k}: Ka = Uj=i ^A,e,j- 

Step 2. Proof of Inequality (4.5). Here we consider Ka as constructed in step 
1, with 

Jog2_ 
21ogA ' 

Since A>2, with this selection of 5 we get that 5 < 1/2. Since k(A) < log A/ log 2, 
it follows that 

A([0, A] \ K A ) < ASk(A) < A/2 whence X(K A ) > A/2 . 

We estimate now the Laplace transform of Sk a ■ We first notice that since Ka is 
included in [0, A], then if tM < (j^rj) A y^-, by applying Lemma 8, we derive 
that 

tM 

logEexp^S^J < 6.2At 2 v 2 + A— exp(-c/ (2tM)) . 
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Since tM < c/(81ogj4), we have 

(4.8) exp(-c/(2*M)) < A~ 2 exp(-c/(4*M)) . 
Consequently 

logEcxp(tSWJ < 6.2At 2 v 2 + A- 1 ^ cxpi-c/iAtM)) , 

proving (4.5) since tM < 1/2. Then we assume in the rest of the proof that 
(c/(2^4)) 1 / 2 < tM < cq/ (log A), and we shall then estimate the Laplace trans- 
form of Sk a by the diadic recurrence. Let t he & positive real. Since Ka,i,i and 
Ka,x 2 are spaced by an interval of size A5 and AS > 2 (since A > 20), by using 
Lemma 15 below and condition (1.3), we derive that 

< Ecxp(tS I<A11 )Ecxp(tS I<A12 ) + exp(-cAS + A(1- 5)tM) . 

Since the variables are centered, Eexp(tSK A L J > 1 for i = 1, 2. Hence by taking 
into account (4.4), we obtain that 

(4.9) logEexp(iSW 4 ) < V 2 logEexp(iSV A . .) + exp(-cAS + A(l - 8)tM) . 

— 'i= 1 

Now, let 

(4.10) i = £(t) = inf{fc G Z : A((l 6)/2) k < ^^}- 

Notice that £(t) > 1 since t 2 M 2 > c/(2A). In addition by the selection of /ca and 
since 5 < 1/2 and tM < y^, it follows that £(t) < ftA- Notice also, by the bound 
on tM and since A > 4, we have 

Ad 2'W-i > 2ttMf- Z - 

Using the homogeneity properties of Ka, the decomposition (4.9) and iterating 
until £(t), we get that 

2* 

logEexp(tS KA ) < J2 lo Z Eex v(tS KA , e J 
i=i 

(4.11) + ^ V exp ( - cA8 { ±^- + 2tMA (1 Z^H 

3=0 

Consequently, for any t < cd/(2M), 

cas (i - sy 



2J 



(4.12) logEcxp^J < ^logEexp(t5 KA , J + ^2 J exp 

3=1 3=0 

Whence, since 2 e ^ < A and tM < cS/2 we obtain 

J2 * cxp(-^^-^) < 2*« exp ( - c 2 5/(2tM) 2 ) < Aexp(-c/(2tM)) . 

3=0 
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Now we estimate each of the terms Eexp(tSK A e j)- By the definition of £(t) the 
conditions of Lemma 8 are satisfied for Sk a 1 6 with B = A((l — 5)/2) e ( t \ Conse- 
quently, 

logEaq>{tS KA t J < B(6.2v 2 t 2 + tM exp(-c/(2iM))) . 
Therefore, by using (4.8), we derive that 

logEexp(tS KA ) < eSv^A + tMA' 1 exp(— c/AtM) + A exp(— c/ (AtM)) 
< Q.2v 2 t 2 A + (c+ l)A -1 exp(-c/4iM). 

This ends the proof of Inequality (4.5). 

Proof of Inequality (4.6). The proof of this part uses the same construction 
with the difference that we do not remove the holes from the set and we use instead 
their upper bound. Once again if tM < (c/(2A)) 1 / 2 , applying Lemma 8 together 
with the fact that exp(— c/(2iM)) < 2tM/c and A > 4, we derive that 

logEexp(iS (0iJ 4]) < A\ogA(6.2t 2 v 2 + (tM) 2 /c) , 

Taking into account that v 2 < KM 2 with K = 1 + 8^ i>x ctfj, the inequality (4.6) 
holds true with C > 6.2K + 1/c. Then we can assume without loss of generality 
in the rest of the proof that (c/(2A)) 1/2 < tM < sM. We start by selecting 5 = 
2tM/c < 1. For this (5, we select Iza as before and I = ^(t) as in relation (4.10). At 
first stage we divide as before the interval [0, A] in 3 parts, the central one having a 
Lebesgue measure AS. Notice that AS > 2 since tM > y/ cj (2 A) > c/A by the fact 
that A > 2c. Consequently, since the variables are bounded by M, by condition 

(1.3) , 

Eexp(tS ((M] ) < [Ecxp(tS KAAA )cxp(tS KAA2 )}e tAMS 

< [Eexp(tS , K 4 , 1>1 )IEexp(^ A , 1 , 2 ) + exp(-ASc + AtM)]e tAMS . 

Since the variables are centered, Eexp(tSjf A i J) > 1 for i = 1,2. Hence applying 

(4.4) and recalling that <5 = 2tM/c, we obtain 

logEexp(tS [0 ,A]) < ^ logEexp(t5if A>lii )) + exp(-A(5c/2) + MM<5 . 

Then, we repeat the same procedure starting with Ka,i.i and Ka,i,2, and after 
£ = £(t) iterations we obtain 

logEexp(t5 (0 ,A]) < y"logEexp(^ A ,J 



+ E2.( cxp( - fc «) +i M^ 

i=0 

The above computation is valid since by the definition of £{t) 

(1-SyW- 1 cS 2tM 

2«W-i > 2{tMf ~ 2{tMf ~ ' 

By the above considerations and the selection of £, proceeding as in the proof of 
Inequality (4.5), we obtain 

tM 1 _1 

logEexp(tS ((M ]) < 6-2v 2 t 2 A + A(— + l) exp(-c/(2iM)) + tMSA (1 - <5) J . 
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12 F. Merlevede, M. Peligrad and E. Rio 

Now notice that for the selection 6 = 2tM/c and since I < Ha, we have 

tAMSe < 2t 2 M 2 A\og A/(clog2) . 
Also since for any x > 0, exp(x) > x V (x 2 /2), wc get 

exp(-c/(2£M)) < (2tM/c) A (8t 2 M 2 /c 2 ) . 

Overall 

logEexp(tS( ,A]) < 6.2u 2 i 2 A + t 2 M 2 A(l/c + 8/c 2 ) + 2t 2 M 2 AlogA/(clog2) . 

By taking into account that v 2 < KM 2 we obtain the desired result with the 
constant 

(4.13) C = 6.2K + (1/c + 8/c 2 ) + 2/(clog2) , 

where K = (1 + 8^>i o 

To prepare for the proof of Theorem 1 we shall reformulate the conclusions of 
Lemma 10 in an alternative form. Keeping the same notations as in Lemma 10, the 
following corollary holds. 

Corollary 11. Let [Xi)i>\ be as in Theorem 1. Assume that A > 2(cV 10) and 

< tM < Co /(log A) with c = f A y^p- Then, there is a constant C" depending 
only on c such that 

Assume that A > 2(cV 10) and < tM < (cA l)/2, then for the constant C defined 
in (4.13), 

, , „ , Ct 2 AM 2 log A 

(4.15) lo g E(exp(^ 1 )< 1 _ 2 , M/( J 1) - 

Before proving Theorem 1 we remark that the second part of the above corollary 
already gives a bound on the tail probability with a correction in the quadratic 
term in x. 

Corollary 12. Under conditions of Theorem 1, for all n > 2(c V 2) and x > 0, 

(4.16) P(|5„| > x) < exp ( - n(log „ )4CA / +4Ma;/(cAl) ) , 
where C is defined in (J^.13). 

4-2. Proof of Theorem 1. 



If n < 16(c V 10) 2 then for any positive t such that tM < 4 ^ cV 1 10 ^ wc get that 
\tS n \ < 4. Hence as in the beginning of the proof of Lemma 8, we derive that 

nv 2 t 2 

logE(exp(tS„)) < 3.1 ; ^ . 

& K yy "~ !-4tA/(cVl0) 2 
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We assume now that n > 16(c V 10) 2 . Let us first introduce the following nota- 
tion: for any positive real A, let Ka be the Cantor set as defined in step 1 of the 
proof of Lemma 8, let X(Ka) be the Lebesgue measure of K a, and let Fa be the 
nondccreasing and continuous function from [0,A] onto [0,A — \{Ka)] defined by 

(4.17) F A {t) = A([0, t] n K C A ) for any t £ [0, A], 

where K% = [0,A]\ K A . Let F^ 1 be the inverse function of Fa- Let Aq = n. Define 
then the real-valued process {X[ )t from (X t )te[o,A ] by 

A, (1) = X F - 1(t) for any * e [0, A a - X(K Ao )}- 

Let A\ = A a — X(Ka )- Clearly, the random process (x[ 1 ')te[o.Ai] is uniformly 
bounded by M and verifies (1.5) with the same constant. We now define inductively 
the sequence (Aj)j>o and the random processes (-X"t^)ie[o, J 4 i ] as follows. First Aq — 

n and (X[ ) = (X t ). And second for any nonnegative integer i, Ai+\ = Ai — X{KAi) 
and, for any t in [0, -Aj+i], 

(4.18) X^=X^_ 1{ty 

Then, for any nonnegative integer j, the following decomposition holds 

(4.19) / X u du = V / X®du+ / X^du. 

J0 i=Q JK Ai Jo 

Let 

(4.20) Yi = [ X$du for < i < j - 1 and Zj = [ X^du. 

Jk a . Jo 

Now set 

L = L n = inf{j e N* , Aj < n/(logn)} . 
Notice that, since A,- < n/2 J ', 

(4.21) L<[(logIogn)/(bg2)] + l. 

Also by the definition of L, Al-x > n/(logn). Since logn < 2y/n, it follows that 

A L -i > Vn/2 > 2(cV 10) . 

Hence, we can apply the inequality (4.14) to each Yj for all < j < L n — 1. Conse- 
quently for every < j < L n — 1, and any positive i satisfying < co/(log(n/2 J )), 

(4.22) logE(ex P (^)) < g^Z^ ±M^Z^ . 
1 ^ — l-Mi(Iog(n/2J))/co 

To estimate Zl, we first assume that Al > 2(c V 2). Applying Inequality (4.15) we 
then obtain, for any positive t such that tM < (c A l)/2, 

/ / Ct 2 M 2 n 
logE(exp(tZ L )) < 



1 - 2tM/(cM) ' 
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To aggregate all the contributions, we now apply Lemma 13 of Appendix with Kj = 
M(log(re/2 J '))/co and of = C^') 1 / 2 + (n/2^)- 1 / 2 M) 2 for < i < L-l, a\ = CnM 2 
and kl = 2M/(cA 1). Consequently, by (4.21), there exists C\ depending only on 
c such that 

EL 
, Ki + K L < CiM(logn)(loglogra) . 
i= l 

Furthermore 

^ L _f 1 cr, + ctl < V / C 7 (4un 1/2 + 2M((logn)/n) 1/2 ) + VCnM . 

Hence by Lemma 13, for any n > 4 and any positive i < l/{MCx log n(log log n)) 
there exists C2 depending only on c such that 

logE(exp(^„)) < 



l-*MCi(logn)(loglogn) 



and the result follows. If A L < 2(c V 2), it suffices to notice that if tM < 2/(cV 2), 
then < 4. Hence as in the proof of Lemma 8, we derive that 

4(cV2)¥M 2 

logE(exp(tZ L )) < 3.1: 



T-iM(cV2)/2 



and we proceed as before with kl = M(c V 2)/2 and ctl = 2(c V 2) M. Inequality 
(2.1) follows from the Laplace transform estimate by standard computations, o 

4.3. Proof of Theorem 2. 

We proceed as in the proof of Theorem 1 with the difference that for n > 2(c V 10), 
we choose 

L = L„ = M{j G N* , Aj < 2(c V 10)} . 

Consequently, 

Tog(n)-log(2(cV10)) 



L < 



4-4- Proof of Theorem 4- 



log 2 



The proof is based on the construction of the Cantor-like sets as described in the 
proof of Lemma 10. Let (e„)n>i be a sequence converging to that will be con- 
structed later. Without loss of generality we assume e n < log 2 and define 

(4.23) <5 



logn 

We impose for the moment that e n has to satisfy 
(4.24) S ny /na n — s- 00 . 

(It is always possible to choose such an e n since (2.5) is assumed). Select in addition 
k n = inf {j 6 N* : n ±— ^ < V^}- 
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Construct the intervals Kk n ,i, 1 < i < 2 kn , as in the step 1 of the proof of Lemma 
10. Then R n = (0,n}\K n , with K n = U|_i-Kfc n) i, has a Lebesgue measure smaller 
than 8 n nk n = o(n) and by Inequality (2.1) there exists a constant C depending 
only on c such that 



Cx 2 al 



5 n k n nM 2 + Mx^/a n /a n {\og n) (log log n) 
Taking into account that liminfn^oo a n /n > 0, Condition (2.5) ensures that 
lim a n logF(\S Rn \ > xa n /y/an~) = —oo. 

According to Theorem 4.2.13 in Dembo and Zeitouni (1998), Sn n is negligible for 
the moderate deviations type of behavior. To treat the main part we rewrite the 
inequality (4.11) by using both sides of Lemma 15 from Appendix and the fact that 
for n large enough, (4.24) entails that for any t 

\tM\ (l-^"- 1 >9 

-= < o n / Z and no n nh _^ > o n ^/na n > 2 . 



2fen-i 



(by using also the fact that liminf n a n /n > 0). So, by definition of the sets Ki 



from relation (4.7), we get for any real t and n large enough, 



logEexp ( tSK " ) - V logEcxp ( tOKk 2± ) I < V V cxp( 



ts K ^ 



cn<5 n (1 - S n y 



Now, by the definition of k n notice that 2 fc ™ 1 < y/n/a n . Whence 
fcn-i 



y 2 J exp(-n(5 n ^ 1 ^ ) < 2 k " exp(-^6 ny /na n ) < cxp(-^6 ny /ni 



3=0 



2J 



/TLQ, 2 C 

< 2 exp(— 6 n y/rm^) < — cxp(log( v / ncv) - -S ny /na n ). 

Uiyi CL n Z 

We select now the sequence e n used in the construction of 8 n , in such a way that 



(4.25) 
Denote 



log(y/na n ) - -5 ny /na n -> oo . 



An 



ma,. 



and _B„ 



(logn) 2 (log n) (log log n) ' 

If na„ > (logn) 5 we select e n = A n X ^ 2 ■ If ria„ < (logn,) 5 we take e n = Bn 1 ^ 2 . 
Notice that by construction and by (2.5), e n as n oo and (4.24) is satisfied. 
It is easy to see that 



log(y/ na n ) — cS n y/na n /2 — > — oo when n — > oo. 
Indeed, if na n > (logn,) 5 , then s n = An 1 ^ 2 , so that 

c A 

\og(y/na n ) - cS ny /na n /2 < (logn)/2-- ^-(logn) 2 

lo S" 2A J 

= (logn)(l-c4/ 2 )/2. 
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If na n < (logn) 5 , then s n = B n ', so that 

log( y /na n ~) - c6 ny /na n ~/2 = log(— ^^77) + log[(log n) 5/2 ] 

(log nyl' 1 

-ce n B n (log log n)/2 < (5/2) log log n - cB]j 2 (log log n)/2 . 
Consequently, for any real t, we get 



logEexp (-— ^==) — y^logEexp ( — Kk ™^_ 



tS 1 



as n — > 00 . 



Therefore the proof is reduced to proving the MDP for a triangular array of 
independent random variables S^ k . , 1 < i < 2 kn , each having the same law as 

Sjc k 4 . By the selection of fc„, for any 1 < i < 2 kn , \\SK k ,J|oo < M^Jna n . In 
addition, for each e > 

-T E 2 "n /d^„, I > «W«3) = , 

by using Inequality (2.1) to give an upper bound of P(|5jf fc J > .t). Hence, by 
Lemma 14, we just have to prove that 



(4.26) 



a- 

lim — ^E(& fe ^) 2 = l, 



With this aim we first notice that since Cardi? rl = o(n), it follows that when n — > 00, 
Var(Sij n )/n — > 0. Hence to prove (4.26), it suffices to prove that 



(4.27) 



Var(^ S Kkn .)/ y~] Var(5K fcn t ) — >• 1 as n ->• 00 . 



! = 1 



Between the two sets Kk nt i and Kk n .j, for i ^ j, there is a gap at least equal to 
n<S n 2 1— * n (1 — <5„) fe "~ 1 > 8 n y/na n , by the selection of fc„. Consequently, since for all 
3, \\SK kn Jog < My/na n , by (1.2) we get 
(4.28) 

2 fc "-i 2 krL — 1 

V V . . cov(5jf. 4 , S/c- , ) < 4 x 2 fc "M 2 na n V exp(-cjrJ nV ^') . 

i>i 



Since 2 kn < 2^n/a n , by using (4.25), it follows that 

(4-29) Y, 2 ^ 1 E^-l! CoHS Kkn ,^S Kkn J = o(n) , 

which together with the fact that C\n < Var(^ 2 =1 Sj( k J < Ctfi implies (4.27). 



4-5. Proof of Theorem 6. 

The proof is similar to that of Theorem 4 with the following modifications. Inequal- 
ity (2.3) is used instead of Inequality (2.1) (notice that (2.8) implies n/M 2 — > 00), 
and the sequences e„ (defining S n ) and k n are selected as follows: 

Jna n 

(4.30 e„ -S- and En—^- -> 00 , 

M„(logn) 2 
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and 

k n = inf {j G N* : n(M n V 1) ^ ~J n ^ < 

Notice that with this selection, 2 fc ™ < 2(M„ V l)yn/a„. By the selection of e„, all 
the steps of the previous theorem can be done similarly. Also to prove that 

2 fc " 2 fc " 

Var(^ Sjf hn ti ) j Var {Sx kn ,< ) 1 as n -> oo , 
i=l i=i 

we make use of Condition (2.7) together with the fact that by the selection of k n , 
for all j, \\Sj( k j\\oo < y/na n . The inequality (4.28) becomes 

2 kn — 1 2^ 71 — 1 

E , E . +1 Cov (^^^„J 

* 'l=\ * 

< 8(M„ V l)na„ ^/ nj a n exp(-cj(5 Tt (M K V 1)~ V na ") i 
j>i 

which implies (4.29) by the selection of i5 ra and the fact that M n — o(y/n). 



4-6. Proof of Corollary 7. 

For each n > 1, let us construct the following sequence of triangular arrays: for any 
i 6 Z, 




where n£ = T, n = [T] , (T > 1) and consequently 2 > <5 > 1. Notice that 

n 

J2Xn,i=T(f T (x)-Ef T (x)). 

i=l 

Now for any k > 1, the strong mixing coefficients, a„(fc), of the processes {X n> i)i£% 
are uniformly bounded by the strong mixing coefficient ak-i of the process (X t ,t £ 
R). Hence to apply Theorem 6, it suffices to show (2.7) and to prove that 

n />OG 

T _1 Var( X n ,ij — > 2 / .^(.t, a;)dw as n — > oo . 
i=i - 70 

The above convergence was proved by Castellana and Lcadbctter (1986) under 
assumptions on g u . To prove (2.7), we first notice that for all j > i, 

Cov(Xi ;fl ,X jtn ) = — L- / K (—[—^) K (—[—) [ [ gt- s {y,z)dsdtdydz . 
° h T JR 2 v h T J \ h T ) J iS _ 5 J jS _ s 

Consequently, since K is a kernel, for all j > i, 

Aj-i+i)5 

\Cov{Xi, n ,Xj, n )\ < / sup \g u (x,y)\du. 

J{j—i—l)t> X ,V 
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Similarly 

Var(X 4> „) < 2 / sup \g u (x, y)\du . 

JO x,y 

Hence (2.7) holds with 

v 2 < 2 / sup \g u (x,y)\du + 4 / sup \g u (x, y)\du . 

JO x,y Jo x,y 

To finish the proof, it remains to notice that if / is diffcrcntiable and such that /' 
is i-Lipschitz for a positive constant I then, since K is a kernel, 

\Ef T (x)-f(x)\=0(h 2 T ), 

(see for instance relation 4.15 in Bosq (1998)). 



5. Appendix. 

One of our tools is the technical lemma below, which provides bounds for the 
log-Laplace transform of any sum of real-valued random variables. It comes from 
Lemma 3 in Merlevede, Peligrad and Rio (2009). 

Lemma 13. Let Zq, Z\, ... be a sequence of real valued random variables. Assume 
that there exists positive constants o~o, <7i, . . . and Kq, K\, . . . such that, for any i > 
and any t in [0, 1/q[, 

IogEexp(«Zi) < M) 2 /(l - Kit) . 
Then, for any positive n and any t in [0, 1/(kq + «! + ••• + K n )[> 

logEcxp(i(Z + Zi + • • • + Z n )) < (<7t) 2 /(l - Kt), 
where a = oo + o~\ + • • • + a n and n = Ko + K\ + • • • + K n . 

The next lemma is due to Arconcs (Lemma 2.3, 2003) and it permits us to derive 
the MDP for triangular array of independent r.v.'s. 

Lemma 14 (Arcones (2003)). Let {X n j; 1 < j < k n } be a triangular array of 
independent r.v.'s with mean zero. Let {a n } n >i be a sequence of real numbers con- 
verging to 0. Suppose that: 

(i) The following limit exists and is finite: 

hm ]T fen E(Xl j ) = a\ 

n— foo *- — '3 — 1 

(ii) There exists a constant C such that for each 1 < j < k n , 



(Hi) For each e > 




Then for all real t, a n JZj—% logEexp(tX n j) — > t 2 a 2 /2 and consequently the MDP 
holds for (X^j=i Xj. n ) with speed a n and good rate function I(t) = t 2 /{2a 2 ). 
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We first recall the following lemma, which is a well-known corollary of Ibragi- 
mov's covariance inequality for nonnegative and bounded random variables. 

Lemma 15 (Ibragimov (1962)). Let Z\, .... Z p be real-valued nonnegative random 
variables each a.s. bounded, and let 

a = sup a(a(Zi : i < k),a(Zi : i > k)) 
fee[i,p] 

Then 

E(Zi ...Z p )< E(Z X ) . . .E(Z P ) + (p - l)a||^i|U ■ ■ ■ II^IU , 

and 

) <E(Z 1 ...Z p ) + (p-l)a\\Z 1 \\ 

oo ' ' ' I 1 1 oo ■ 
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